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Abstract — Puncturing is a well-known coding technique widely 
used for constructing rate-compatible codes. In this paper, we 
consider the problem of puncturing low-density parity-check 
codes and propose a new algorithm for intentional puncturing. 
The algorithm is based on the puncturing of untainted symbols, 
i.e. nodes with no punctured symbols within their neighboring 
set. It is shown that the algorithm proposed here performs better 
than previous proposals for a range of coding rates and short 
proportions of punctured symbols. 

Index Terms — low-density parity-check codes, intentional 
puncturing, short-length codes. 



I. Introduction 

Low-density parity-check (LDPC) codes are considered 
fixed-rate ciiannel codes since they incorporate a fixed amount 
of redundant information H]. However, there exist some well- 
known techniques for adapting the coding rate of a linear 
code, one of them is puncturing. A linear code is punctured 
by deleting a set of symbols in a codeword. 

When puncturing a code, we must differentiate between 
random and intentional puncturing. In the former, punctured 
symbols are randomly chosen, whereas in the latter, the code is 
analyzed to select the set of symbols to puncture. The asymp- 
totic performance of random and intentional punctured LDPC 
codes was analyzed in O, and puncturing thresholds were 
also identified in [4J. Some other methods delved into the 
code structure to identify good puncturing patterns ||5], B, 
or examined its graph construction lEl, Jll to facilitate its 
puncturing ll9l-|[T2l. 

The objectives pursued when switching from random punc- 
turing can lead to different solutions. In particular, algorithms 
that focus in covering a wide range of coding rates do not 
offer the best performance for small puncturing proportions 
and vice-versa. Several algorithms find puncturing patterns 
that allow to cover a wide range of rates Q, lEI, lfT3l - llT5l . 
However, when working with short length codes, but also in 
other scenarios, the ensemble of punctured symbols determines 
the decoding performance. In this work, we describe an 
algorithm that we call untainted. Its main focus is optimizing 
the decoding of moderately punctured codes. 

The rest of this paper is organized as follows. In Section HI] 
we introduce the notation, some puncturing properties and 
the untainted algorithm. In Section |III1 we present simulation 
results over several channels, and compare them with the 
results in previous studies ||5], 0. Conclusions are presented 
in Section HV] 
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II. Untainted Puncturing 

A. Notation and Definitions 

Error correcting codes can be represented by bipartite 
graphs linking symbol nodes with check nodes. LeQ N{z) 
denote the neighborhood of a node z, that is, the set of nodes 
adjacent to z. The degree of a node is defined as the cardinality 
of its neighborhood. This concept can be extended to include 
all the nodes reachable from z by traversing a maximum of k 
edges, we call this set of nodes N^{z) the neighborhood of 
depth k of z. 

Let V stand for the set of punctured symbols, v <E P belongs 
to the set of 1-step extended recoverable symbols (v G TZi) 
if 3c G Af{v) such that e Af{c)\{v}, w ^ V. For n > 1 
we recursively define the sets of fc-step extended recoverable 
symbols Rk. We say that a punctured symbol v ^ TZi U ... U 
TZk-i belongs to the set of fc-step extended recoverable (v G 
TZk) symbols if 3c € J^{v) and 3w G J\f{c)\{v} such that 
w G 7^fe-l and Vw' G Af {c)\{v , w} , w' G "P ^ w' G 7^l U 
...U7^fc_l. 

The graph subjacent to A/'^'^(f), v G TZk, is assumed to be 
tree-like. Let u be a node in J\f^''{v). We denote by 
the neighborhood of u of depth t restricted to the descendants 
of u in this tree. Note that Afi{u) — J\f_i^{u). We can prune it 
by eliminating the connection of any symbol w eTZi with a 
check c G Mi{w) if max„/gjv'^(c) {m\w' G TZm} > I — I. We 
call this graph %, the extended recovery tree of v. 

We consider in this letter the sum-product decoding algo- 
rithm. The algorithm exchanges messages representing prob- 
abilities or the log-likelihood ratio (LLR) of probabilities. 
The messages are iteratively exchanged from symbol to check 
nodes and from check to symbol nodes. If the decoding graph 
is tree-like then, for uniform sources and output symmetric 
channels, sum-product decoding is equivalent to maximum a 
posteriori decoding and the decoder minimizes the decoding 
error |]16]. 

Let us analyze the effect of puncturing on the messages 
exchanged in the sum-product algorithm. For every v ^ V 
the decoding algorithm has no information on the value that v 
takes. In consequence, for binary input channels it can take 
both values (one and zero) with probability one half. On 
the LLR version of the sum-product algorithm the outgoing 
messages from symbol v on iteration one are equal to zero. 

A check c G N{v) in the neighborhood of a punctured 
symbol v is called a survived check node if 3w G N:^{c)\w G 
TZk-i, and a dead check node otherwise. The message that a 
dead check node c sends to u is a zero LLR. A punctured 

'We follow a notation similar to that in Ha et al. 
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symbol is recovered when it receives a message from a 
survived check node. It follows from the previous definitions 
that a symbol in TZk is recovered after k decoding iterations. 

We define the extended recovery error probability of a 
punctured symbol Pe{v) as the decoding error probability of 
V G TZk after k iterations on Tv Assuming that a codeword 
is sent through a binary input memoryless output symmetric 
channel the probability of error is independent of the codeword 
sent [16 1 . Then Pe{v) is the probability that v takes the value 
one conditional to sending the all zero codeword. 

These concepts, are extended in the sense that they general- 
ize the definitions introduced by Ha et al. in [5J. The difference 
between both sets of definitions is that every punctured symbol 
is connected to only one survived check node in the (non- 
extended) recovery tree in [5 | and, in consequence, the (non- 
extended) recovery error probability is given by the probability 
that the message sent by the survived node is wrong. 

B. Properties of the extended recovery error probability 

Ha considered the exact recovery error probability over the 
(non-extended) recovery tree for the binary erasure channel 
(BEC), the additive white Gaussian noise (AWGN) channel 
and the binary symmetric channel (BSC). This error probabil- 
ity is a monotone increasing function on the number of nodes 
in the recovery tree. The algorithm in [31 was developed to 
exploit this non-intuitive property. 

The single survived check node assumption captures the tree 
structure when a high proportion of symbols are punctured. 
However, for a low proportion of punctured symbols there 
can be more than one survived check node. We now show 
that having more than one survived check node is a desirable 
property. More precisely, adding a survived check node in 
an extended recovery tree can not increase Pe{v). We first 
prove a stronger claim on the BEC, i.e. adding a survived 
check node decreases Pe{v). Then, we prove the property for 
general symmetric channels. The idea behind the general proof 
is that we can reduce the number of survived check nodes by 
adding noise to the symbol nodes under a survived check node. 
Then, given that the sum-product algorithm on tree like graphs 
with uniform priors is equivalent to a maximum a posteriori 
estimation, the decoding on the noisier tree can not reduce the 
decoding error probability. 

Theorem 1: Let Z, fc G N and Q < I < k. Now consider 
the subgraph of a check node z of depth 2fc — 2/ — 1 such 
that maXj„£^(-2) {ra\'w G TZm} ~ k ~ I. Let Tvi,Tv2 be the 
extended recovery trees associated with punctured symbols 
vi,V2 G TZk- Let both trees be identical except for some 
X G A/'^'(w2) that is linked with z. Then the recovery 
error probability of vi and V2 sent through a BEC(a), with 
< a < 1, verify: 

P,{vi) > PeM- (1) 

Proof The initial erasure probability of a symbol v is: 
" \ a otherwise 



further, e can be recursively defined for any symbol and check 
in the tree from its children erasure probabilities: 

£.=4°^ n (2) 

ec = l~ n (1-^")- (3) 

Now, taking into account that there are no punctured sym- 
bols within the leave nodes by the definition of the extended 
recovery tree, it holds that el*^^ < 1 for the leaf symbols of the 
tree spanning from check z. It follows by induction that: 1) 
Vv,c G 7V;^''"^'"^(2;), e„, ec < 1; which implies that < 1, 
and 2) if we attach a check node z with < 1 to a symbol 
X then ei,j > ei,^ . Finally, the recovery error probability of a 
symbol node v is Pe{v) ~ et,/2, which completes the proof. 

■ 

Theorem 2: The recovery error probability of vi and V2 
over the trees Tv^ , Tv2 defined exactly as in Th. [T] and 
sent through any binary input symmetric output memoryless 
channel C, verify: 

Pe{vi) > Pe{v2). (4) 

Proof: For a precise characterization of Pg in the general 
setting, we need to track a message density instead of a scalar 
The initial density function of a non-recovered punctured node 
takes the form of the Dirac delta function D{y) = S{y), since 
a punctured node transmits a zero LLR with probability one. 
Let the remaining nodes in Tv^ , have initial densities given 
by Po{y), the initial LLR density associated with channel C. 

Now consider a second scenario for ■ We associate every 
leaf node in A/'^'^~^'~^ (z) with samples from D{y), which 
is equivalent to puncturing these nodes. If we puncture the 
leave nodes, their parent check nodes do not become survived 
check nodes and the symbols w G A/J'^~^''~^{z)\w G 'P are 
not recovered. It follows by induction that z remains a dead 
check node, i.e. associating the leaves with D{y) is equivalent 
to eliminating the edge joining z to x. In consequence, the 
density of messages reaching the root node V2 in the second 
scenario is identical to the density of messages reaching vi in 
the first scenario. 

The (binary output) degenerate channel D, with initial 
density D{y), transforms any input into a one or a zero with 
equal probability, i.e. p£i{l\x) — po{{)\x) — 0.5. Let Q 
represent the concatenated channel of C with D: 

PQ{y'\x) = ^PD{y'\y)pc{y\x) 

yey 

= 0.5^pc{y\x) = pD{y'\x). (5) 

In other words, D can be regarded as the concatenation of 
C with itself, and the samples from D{y) are stochastically 
degraded samples of Po(y) [17|. 

Following the argument in flSl Th. 5], Eq. (|5]l implies that 
Pe{vi) > Pe{v2)- The assertion follows from the fact that the 
estimate of both vi and V2 are maximum likelihood estimates. 
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C. Untainted Puncturing Algorithm Description 

We introduce the concept of untainted, to propose a simple 
method that chooses symbols such that all the check nodes of a 
selected symbol are survived nodes. This restriction guarantees 
that more than one survived check node is associated with 
every punctured symbol. 

Definition 1: A symbol node v is said to be untainted if 
there are no punctured symbols within Af'^{v). 

Let X^o be the set of untainted symbol nodes. Initially, when 
there are no punctured symbols, X^o consists of every symbol 
node. 

{Initialize} Xoo = {1, n}, p = 1. 
while A'oo ^ do 

{Step 1.- Look for candidates] 

Make the set of candidates 51, which is a subset of X^o, 
such that u e if W^{u)\ < \J\f^iv)\ for any v G Xqq. 
{Step 2.- Select for puncturing] 

Pick a symbol node v^^^ from (pick one randomly if 
there exist more that one symbols in Vl). 
{Step 3.- Update the set of untainted symbols} 
X^^X^\N^{v) 
p = p+l 
end while 

The algorithm obtains a set of puncturable symbol nodes 
consisting in the symbols selected in the second step. It 
concludes when Xqo ~ 0, i-C- there is no untainted symbols. 
The range of values for p, the number of punctured symbols, 
can be found empmcally by simulations (see Table 

Note that for codes with an almost regular check node 
degree distribution, the searching criterion in Step 1 can be 
simplified: instead of looking for a symbol with the smallest 
neighboring set of depth 2, the algorithm can look for symbols 
with the lowest degree. 

III. Simulation Results 

The untainted algorithm ensures that the extended recovery 
tree of a punctured symbol has more than one survived check 
node. In this section, we construct codes to show that the 
untainted algorithm yields a better performance in terms of 
the frame error rate (FER). We have constructed 10^ bit- 
long irregular LDPC codes of different coding rates for the 
BEC, the BSC and the AWGN channels. The polynomials for 
the BSC with rates (0.5,0.6,0.7,0.8) have been drawn from 
09]. The remaining generating polynomials, as well as all the 
matrices used can be checked in [20] . 

Figs. [T] |2] and [3] compare the performance of intentional 
punctured codes using the untainted algorithm with those in jS) 
and in |6 1 for different proportions of punctured symbols tt. All 
codes are decoded with 200 iterations using the sum-product 
algorithm llT6l over the graph of the mother (non-punctured) 
code. 

As in Ha et al. [5| we used three random seeds for 
the simulations of each intentional puncturing algorithm, the 
results in the figures show the intermediate performer. 

The untainted punctured LDPC codes outperform the codes 
punctured following the algorithms in |I5] and in O for all the 
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Fig. 1. FER over the AWGN as a function of the signal-to-noise ratio 
(SNR). Two LDPC codes with coding rates i?o = 0.5 and Rq = 0.6, and 
two different proportions of punctured symbols, tt = 5% and tt = 10% were 
used. 
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Fig. 2. FER over the BEC with crossover probability e for different inten- 
tional puncturing strategies. Two LDPC codes with coding rates Rq =0.5 
and Ro = 0.6, and two different proportions of punctured symbols, tt = 5% 
and TT = 10% were used. See inset in Fig. [T]for symbols and line-styles. 

coding rates, channels and puncturing proportions considered. 
For a FER of 10^"^ the strongest improvements appear in the 
BSC for a mother code of rate Rq = 0.3 punctured a 10%, 
in the BEC for a mother code of rate Rq — 0.6 punctured a 
5% and in the AWGN for a mother code of rate Rq — 0.6 
punctured a 10% respectively. 

Table U shows Pmin and Pmax, the minimum and maximum 
values of p, respectively, after 5 • 10"^ algorithm runs. Sizes are 
computed for both the untainted algorithm and the algorithm 
in ||5|. The values are computed for the same codes used 
in Fig. [3] and two additional codes of rates 0.7 and 0.8. 
The table shows that the untainted algorithm can puncture a 
smaller number of symbols compared to [5 1. This behavior is 
consistent with the additional number of survived check nodes 
required by the untainted algorithm. 

IV. Conclusions 

We proved that having more than one survived check node 
in the extended recovery tree is a desirable property. The 
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Fig. 3. FER over the BSC with crossover a probability £ for several LDPC codes with coding rates Rq = 0.3, Rq = 0.4, iJo = 0-5 and Rq = 0.6, and 
two different proportions of punctured symbols, tt = 5% and tt = 10%. 



TABLE I 
Number of Punctured Symbols 

Coding rate (mother code) 







0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


Ref . 15J 


Pmin 


4628 


4031 


3439 


2866 


2258 


1581 




Pmax 


4753 


4139 


3552 


2983 


2353 


1657 


Untainted 


Pmin 


2603 


2286 


1909 


1587 


1212 


851 




Pmax 


2686 


2374 


1987 


1655 


1273 


901 



untainted algorithm is a method that chooses symbols such 
that all the check nodes of a selected symbol are survived 
check nodes. Furthermore, the algorithm can be implemented 
with a low computational cost. 

Simulation results show that the performance of the un- 
tainted algorithm in terms of the FER is better than the 
best intentional puncturing algorithms in the literature for a 
range of coding rates, channels and puncturing proportions. 
The drawback of using this algorithm is a reduction in the 
maximum achievable rate since the proportion of puncturable 
symbols is limited by the untainted criterion. 
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